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1 .  INTRODUCTION. 


Let  x(}j  <  x(2)  <  •••  <  x(n)  orc*er  statistics  of  a 

random  sample  from  a  continuous  distribution  G(x)  with  location  and  scale 
parameters  a  and  B  *,  thus  G(x)  =  F(w)  where  F(w)  is  a  completely 
specified  distribution,  x  =  a  +  Bw  ,  and  =  a  +  Bv^j,  i  =  l,i..,n  . 

Let  nn  =  E{W|. j}  where  E  denotes  expectation.  Suppose,  to  fix  ideas, 

that  G(x)  is  the  exponential  distribution  G(x)  =  1  -  exp(-x/3),  so  that 

a  is  zero,  and  suppose  B  is  unknown.  A  well-known  technique  exists  to 

transform  the  x, .  ■>  to  a  set  Z/.>  ,  which  will  be  distributed  as  a  set  of 
u)  (i)  ’ 

uniform  order  statistics. 

The  transformation,  first  introduced  by  Sukhatme  (1937)  depends 
on  normalized  spacings  y^  ,  defined  as  follows: 

yi  =  {x(i)  “  x(i-l)}//(mi  "  “i-l^’  1  =  (!) 

where  x^  =  m^  =  0  .  For  an  exponential  set  x  ,  from  G(x),  the  values 
y^  will  be  a  random  sample  from  the  same  distribution.  A  further  trans¬ 
formation  J  gives  values  ZQ)>  as  follows: 

Z(i)  =  Ej=l  yj/EJ=l  yj  ’  1  =  1,****n  *  (2) 

It  is  well  known  that  the  values  i  3  l,...,n-l  ,  are  distributed 

as  the  order  statistics  of  a  sample  of  size  n  -  1  from  the  uniform 
distribution,  with  limits  0  and  1  ,  written  U(0,l).  A  test  of  the 
null  hypothesis  that  the  x^  are  from  G(x)  can  then  be  made  by  testing 
that  the  z,.  •.  are  (ordered)  uniform  random  variables.  Seshadri ,  Csorgo 


and  Stephens  (1969)  defined  the  K  transformation  to  be  the  combination 
of  the  Sukhatme  transformation,  say  N  ,  and  the  J  transformation  above, 
to  produce  from  the  original  x-set ;  symbolically,  jz  =  Kx  =  JNx  , 

where  z  ,  x  are  the  vectors  of  the  z,.N  and  x,  .  .  These  authors 

~  ~  ( 1 )  ( 1 ) 

2 

found  that  the  Anderson- Darling  statistic  A  ,  applied  to  the  2(jj> 
provided  a  powerful  test  for  exponent iality  of  the  original  sample  x  . 

O'Reilly  and  Stephens  (1982)  gave  characterization  properties  in  support  of 
transformation  K  ,  as  well  as  further  power  studies. 

An  important  property  of  normalized  spacings  is  that,  in  the 

limit,  for  any  regular  parent  population  for  x  ,  and  for  "sufficiently 

separate"  indices  k  and  l  ,  y^  and  y^  converge  to  independent 

exponentials,  as  k,  l,  n  -*•  00  ,  and  k/n  -*■  p.  and  £/n  ■+  p.  ,  with  both  p. 

^  J  ^ 

and  Pj  in  (0,l)  and  different;  see  Pyke  (1965,  p.  **07)  for  more  rigour 

and  details.  Thus  it  is  attractive  to  devise  tests  for  various  distributions 

based  on  their  normalized  spacings  and  on  the  values  z(jj*  A  &°°d 

feature  of  the  tests  is  that ,  although  it  is  necessary  to  know  the  values 

,  it  is  not  necessary  to  estimate  unknown  parameters:  furthermore,  the 
tests  may  be  easily  applied  to  both  left-  and  right-censored  data.  On  the 
other  hand,  it  is  not  easy  to  find  distribution  theory  for  test  statistics; 
in  particular,  despite  the  above  result,  it  is  not  correct  to  regard  the 
z^^  as  ordered  iid  uniforms,  even  asymptotically.  In  this  article  we 
discuss  asymptotic  theory  for  the  mean,  the  median,  and  the  Anderson-Darling 
statistic,  in  particular  for  tests  for  the  normal, logistic,  and  extreme-value 
(or  Weibull)  distributions,  and  follow  up  the  normal  tests  with  a  power 


Censored  data.  Normalized  spacings  can  be  used  for  the  more  general 


problems  where  the  data  is  singly-  or  doubly-censored.  For  doubly- 
censored  samples,  or  for  samples  with  no  endpoints,  the  normalized 
spacings  are  defined  as  follows.  Suppose  there  are  r  +  2  ordered 
observations  available,  namely  x(k)»x(k+i) . x(k+r+l)’  tlien  define 


yi  =  {x(k+i)  '  x(k+i-D}/(\+i  -  Vn-i5’  1  =  1’”-’r+1 


(3) 


For  singly-censored  samples,  and  with  a  known  endpoint,  an  extra  spacing 
can  arise.  If  the  distribution  has  a  known  lower  endpoint  A  ,  and  if 
the  x-sample  is  right-censored  only,  the  first  spacing  is 

yi  =  ^X(l)  ~  A^^mi  “  A^»  8X1,1  yi+i»  1  =  now  be  given  by 

the  right-hand  side  of  (3).  Similarly,  if  the  distribution  of  x  has  a 

known  upper  end-point  B  ,  and  if  the  sample  is  left-censored  only,  y^ 
will  be  given  by  (3),  for  i  =  l,...,r+l,  and  the  added  last  spacing  is 

yr+2  =  {B  -  x(k+r+1)}/{B  -  m(k+r+l)^'  For  either  r  +  2  or  r  +  1  values 

of  y^  ,  the  J-transformation  can  be  applied  to  give  r  +  1  or  r 

values  z for  the  rest  of  this  article  we  shall  assume  there  are 

(i)  ’ 

r  +  1  values  of  y^  .  Then  transformation  J  gives  r  values  z(^): 


z 


(i) 


i  r+1 
E  yj  £  Yz  ,  i 
J-l  J=1 


1 ,  •  • . ,  r 


(«*) 


If  the  original  x  were  exponentials,  the  y  are  exponential,  and  the 
z^j  are  again  ordered  uniforms.  For  other  parent  populations,  the  z^ 


are  not  ordered  uniforms,  but  statistics  used  for  tests  of  uniformity  can 


still  be  used.  We  investigate  tests  based  on  the  Anderson-Darling  statistic 
2 

A  ,  calculated  from 


=  -  r  -  (1/r)  (  I  (2i-l ) [ log  z 
i=l 


(i) 


+  l0gU  -  Z(r+l-i)}]: 


(5) 


where  log  x  refers  to  natural  logarithm;  also  two  tests  based  on  the  median 
and  the  mean  of  the  z(£)’  These  test  statistics  are  M  and  T  given  by 


M  = 


r'S[z 


({ r+1 }/2) 


-  il 


r  odd 


-T*[z  “  (r+2) /{2(r+l) }  ]  r  even 

((r+2}/2) 


T  = 


where 


z 


i 


/r  . 


These  statistics  are  investigated  because  they  are  closely  related  to 
statistic  S  ,  introduced  by  Mann,  Scheuer  and  Fertig  (1973)  and  statistic 
S*  ,  introduced  by  Tiku  and  Singh  (l98l),  for  tests  for  the  extreme-value 
and  Weibull  distributions.  When  the  x  set  is  from  an  extreme-value 
distribution  the  statistic  S  is  the  same  as  1  -  z(t)»  where  t  =  (r+l)/2 
when  r  is  odd  and  t  =  (r+2)/2  when  r  is  even;  hence  M  =  r  (0.5  -  S) 
for  r  odd  and  M  =  r  [r/{2(r+l}}  -  S}  for  r  even.  Statistic  S*  is  2z  . 
These  statistics  were  found,  by  the  authors  above,  to  give  powerful  tests 
for  the  extreme- value  distribution  (or  for  the  2-parameter  Weibull 
distribution  by  taking  logarithms  of  the  data)  against  certain  types  of 
alternative . 

In  order  to  calculate  the  statistics,  values  of  el  (or,  more 
precisely,  values  of  the  difference  ^  *  itl  -  el  ^)  are  needed.  For  the 


normal  distribution  extensive  tables  can  be  found  in  Harter  (l96l),  and 
are  reproduced  in  Biometrika  Tables  for  Statisticians,  Vol.  2;  also  computer 

routines  exist  to  calculate  the  nr  very  accurately.  For  the  extreme- 
value  distribution  tables  of  k^  are  given  for  3  S  n  5  25  by  Mann, 

Scheuer  and  Fertig  (1973).  For  the  logistic  distribution 
k^  =  n/{(i-l) (n-i+l) } ,  i  =  2,...,n  . 

In  this  article  we  give  general  asymptotic  theory  for  the  above 
test  statistics,  and  apply  the  theory  to  tests  for  the  normal,  logistic 
and  extreme-value  distributions.  Significance  points  are  given  for  the 
normal  and  logistic  tests,  and  the  power  of  the  normal  tests  is  investigated 
in  detail.  Percentage  points  and  power  results  for  tests  for  the  extreme- 
value  distribution  are  recorded  elsewhere  (Lockhart,  O'Reilly  and  Stephens, 


2.  ASYMPTOTIC  THEORY  OF  THE  TESTS. 
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The  statistics  A  ,  M  and  T  are  functionals  of  the  quantile 

process  Q  ( t )  of  the  z.  ,  where 
n  i 

Qn(t)  =  Az^j-t),  0  S  t  5  1  , 

(here  v  is  the  greatest  integer  in  (r+l)t,  and  =0  and  2(r+1) 

by  definition) ,  and  of  the  empirical  process 

Rn(t)  =  r^Kr'1  I(z.  <t)-t]  0  <  t  <  1  . 

Here  l(B)  is  the  indicator  function;  1(B)  =  1  if  event  B  occurs, 
and  1(B)  =  0  otherwise.  Specifically,  it  may  be  shown  that 

M  =  Qn(4)  +  op(l)  ; 

1 

T=  Q(t)dt+o(l)  ;  and 
J  n  p 

0 

1 

A2  =  |  R2( s )ds/ {s ( 1-s ) } . 

0 

Suppose  distribution  F  has  density  f  with  derivative  f  . 

Define 

c(x)  =  -  (1  +  (l-x)f(F"1(x))/f2(F"1(x)))  , 


and  set 


-  8  - 


s 

I1(s)  =  |  (l  +  uc(u) )/(l-u)du  , 
0 
f 

I2(s)  =  J  c(u)l1(u)du  . 


t 

and  Ig(s,t)  =  |  c(x)dx  . 
s 

Set  pQ(t,s)  =  pQ(s,t)  =  s  +  2  I2(s)  +  I^sjl^s.t)  0  5  s  5  t  5  1  .  (6) 

Finally  if  0  5  p  <  q  5  1  and  0  5  s  ,t  5  1  set  t*  =  p  +  t(q-p) 
and  s*  =  p  +  s(q-p)  and  let 

p(s,t)  =  (q-p)"1  {pQ(t*,s*)  -  spQ(t*,q)  -  (l-s)p0(t*,p) 

-  tpQ(s*,q)  -  (1-t )pQ(s*,p)  +  stpQ(q,q)  (7) 

-  (l-s)(l-t)pQ(p,p)  +  ( s  +t-2st ) Pq ( p ,q ) } . 


To  simplify  notation  we  shall  sometimes  omit  the  arguments  of,  for 

example,  Qn(t),  and  of  p(s,t).  The  asymptotic  theory  of  the  test  statistics 

is  based  on  the  following  conjecture: 

Under  and  some  regularity  conditions,  as  n  -*■  00  ,  k/n  p  and  (k+r+l)/n  -*•  q. 


(i) 

Qn 

converges  weakly  in 

D[0,1] 

to  a  Gaussian  process  Q  with  mean 

zero  and  covariance  p  , 

• 

(ii) 

R 

n 

converges  weakly  to 

R  =  -  Q 

,  and 

(iii ) 

A2 

converges  in  distribution  to 

1 

1  R2(s ) / {s ( 1-s ) }ds  . 

0 
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If  the  conjecture  is  correct  then  A  is  distributed  asymptotically 
as  £  where  ^  ?  Aj  ;  . . .  are  the  eigenvalues  of 


Af(x)  =  |  f(t )p*{ s ,t )dt 


1  -  <•} 

with  p*(s,t)  =  p(s,t)/{(s(l-s)t(l-t )} 2,  and  un  are  independent 

V  2  Q  2 

variates.  Moreover,  T  -*■  N(0,aT)  and  M  -*■  N(0,o,^)  . 


Where 


J.  ± 

=  j  j  p(s  ,t )  dsdt  and  0^  =  . 


The  argument  leading  to  the  conjecture  is  as  follows.  First  the 

weak  convergence  of  Qn  implies  that  of  by  a  standard  Skorohod 

construction  argument.  Similarly  weak  convergence  of  Q  follows 

n 

immediately  from  weak  convergence  (under  a  =  0,  8  =  l)  of 
_1  [nt] 

H  (t )  =  n  (n  £  Y  -  t}  to  a  Gaussian  process  n  with  mean  zero  and 
j=l  J 

covariance  p_  .  The  process  Q  is  then  Q ( t )  =  ( q— p)  *[ rj fp+t ( q-p) }  -  tn(q’) 


(l-t)n(p) ] . 


To  deal  with  n  ,  set  v.  =  -  log{l  -  F(x,.,)}  and 
n  l  (i ) 

d^  =  (n-i+l)( v^-v^  ^).  Then  the  v^  are  ordered  standard  exponential 

varieties  and  the  d^  are  independent  standard  exponential  variates.  We 

have  E(v. )  =  u.  =  £^  (n-J+l)  ^  =  -  log(l  -  — •  -)  +  0(n  for  i/n  bounded 

i  i  J  =1  n+1 

away  from  1.  Set  H(y)  =  F  ^(l-e  ^ )  and  expand  the  relation 


10  - 


x /  .  -  x,  .  ,  .  =  II (v.  )  -  H(  v.  .)  as  a  Taylor  series  about  u  to  get 
(l)  (l-l)  1  l-a  ■  1-i 


•  i  “  x/’-_Ll\  =  d. {(n-i+l)  1  H'(u.  ,)  +  (n-i+l)  ±  n"(u  ,)(v  -  u 

l)  (l+l)  i  i-l  i-l  1  —  i  i- 


1 


)} 


+  0  (n-2). 
P 


Take  expectations  to  get 


E{x(i)  -  x(i-l)'  =  (n'i+irl  H,(ui-1}  +  °(l 


then 


y.  =  d.  {l  +  (v.  ..  -  u.  )  H"  ( u.  )/H'(u.  )}  +  0  (n  *) 

l  l  i-l  i-l  i-l  i-l  p 


Thus 


nn(t: 


_l,  [  nt  ]  _1 

=  Z  d.  {l  +  (v.  -  u.  )  H"(u.  )/H'(u.  )}  -  nt  )  +  0  (n-*) 

.  ,  l  i-i  i~l  i-l  i-l  p 


i=l 


=  n*(t)  +  0  (n  '*)  . 
n  p 


Under  mild  conditions  n*  has  E{r)*(t)l  +  0  and  Cov{n*(t )  ,n*(  s )  }  ■+  p_(s,t). 

n  n  n  n  J 

A  martingale  central  limit  theorem  can  be  applied  to  prove  (under  somewhat 

more  stringent  conditions)  that  n*  converges  weakly  to  n  ■ 

This  falls  short  of  a  rigorous  proof  because  the  remainder  terms  are  not 

small  uniformly  in  t  .  Moreover  weak  convergence  of  does  not 

automatically  imply  conjecture  (iii).  Nevertheless  Monte  Carlo  results 

indicate  clearly  that  the  conjecture  is  true  for  the  normal »  logistic  and 

extreme  value  distributions. 

Furthermore,  in  the  case  p  >  0  ,  q  <  1  the  arguments  for  (i)  and 
(ii)  can  be  made  rigorous  for  these  distributions. 
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3.  COVARIANCE  FUNCTIONS  FOR  SPECIFIC  DISTRIBUTIONS. 


3.1  In  the  case  where  F(x)  =  1  -  e  X,  the  standard  exponential 
distribution,  the  values  .  .  >z(r)  are  distributed  exactly  as  the  order 

statistics  of  sample  of  size  r  from  a  uniform  distribution  on  (0,1).  The 
asymptotic  distribution  of  nn  is  then  that  of  3rownian  motion  and  Q,  R 
are  Brownian  bridges.  See  Seshaari ,  Csorgo  and  Stephens  (196?)  for  details. 


3.2 


When  F  is  uniform  we  have  m.  -  m.  „ 

l  l-l 


(n+1 )-1 


a  constant . 


711611  Z(i)  =  (x(k-i)  '  X(k))/(x(k+r+l)  "  x(k)}  S°  that  2(1)  •••  ” ( r )  are 
again  distributed  exactly  as  the  order  statistics  in  a  sample  of  r  uniform 
variates  on  (0,l).  Again  Q  and  R  are  Brownian  bridges. 


3.3  When  F  =  $  ,  the  standard  normal  distribution  with  density  4>  , 
we  find 

c(x)  =  (l-x)'J’  1(x)/(()($-1(x) ) 

I1(s)  =  [s  +  s($>-1(s))2  +  4>_1(s)4>($-1(s))]/2 

I2(s)  =  -[(s2+s)A  +  (s2-s)(<t_1(s))2/s  +  (2s-l)<?-1(s)4>(<J>-1(s))  A 
+  (s2-s)$"1(s))1*/f  +  (2s-1)($"1(s))3<K<T1(s))/U 
+  H"1(s))2<))2(<5>"1(s))/U]/2 
I^(s,t)  =  J(t)  -  J(s)  for  0  <  s  5  t  <  1 


J(t)  =  [($_1(t))2(l-t)  -  *-1(t )4>( <t>_1  (t ) )  -  t]/2 


where 


These  are  used  in  (t>)  and  (7)  to  give  p(s,t). 

3.  A  For  F(x)  =  exp{-  exp(-x)},  the  extreme  value  distribution,  wo 

c(x)  =  (log  x)  1  -  x  1  -  (x  log  x)  ] 


l1(s)  =  E1(-  log  s) 

I2(s)  =  -  E^(-  log  s)/2  +  (log  s  -  log(-  log  s))  E 1 (-  log  s)  -  s 


-1 

y 


—  y 

log(y)e  7dy 


-  log  s 


I3(s,t)  »  K(s)  -  K(t)  0  <  s  5  c  <  1 


where 


K(s)  ■  E^(-  log  s)  +  log(-  log(s))  +  log  s 


and 


E^y)  “  |  x  1e  Xdx  . 

y 


These  expressions  are  used  in  (6)  and  (7)  to  give  p(s,t).  The 
extreme  value  distribution  is  sometimes  written  in  the  form 
F*(x)  =  1  -  exp{-  exp(x)},  -  <*>  <  x  <  00  .  F*(x)  is  the  distribution  of 


where  x'  has  the  distribution  F(  •)  at  the  beginning  of  this  subsection 
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For  F*(x),  the  covariance  p*(s,t)  is  found  from  p(s,t)  for  F(x), 
by  the  relation  p*(s,t)  =  p(l-s,l-t). 

3.5  For  the  logistic  distribution,  F(x)  =  (l  +  e  )  ,  we  have 

c ( x)  =  (x-l)/x 

Ix(s)  =  -  s  -  log(l-s) 

s 

Ig(s)  =  s  -  s^/s  +  |  u  ^(l-u)log(l-u)du 
0 

I^(s,t)  =  t-s  +  log  s  -  log  t,  0  <  s  5  t  <  1  ; 


p(s,t)  is  again  calculated  from  (6)  and  (7). 


k.  DISTRIBUTIONS  AMD  PERCENTAGE  POINTS. 

'2 

Statistic  A  .  Using  the  covariance  functions  calculated  above  we  have 
found  the  eigenvalues  X.  of  n*  by  discretization  of  the  integral  involved 
in  (5).  That  is,  we  found  the  eigenvalues  X  , . . . , of  the  matrix  system 

k 

Af{i-*s)/k}  =  [  E  f{(j-ss)/k}p{(i-!2)/k,(j-!s)/k}  ]/k  . 

J=1 

In  these  calculations  k  =  100  is  adequate  (in  the  sense  that  further 
increases  of  k  do  not  significantly  change  the  critical  points).  Having 
found  the  A^  we  evaluated  the  critical  points  of  E^0^1  A  .ok  by  Imhof's 

method  (see  Durbin  and  Knott,  1972).  The  resulting  critical  points  are  in 
Table  1,  for  the  normal  and  logistic  distributions,  for  various  0  5  p  <  q  5  1. 
(Recall  that  p  =  k/n  and  q  =  (k  +  r  +  l)/n  ,  as  k,  r,  n  -*■  °°)  . 

In  the  case  of  symmetric  distributions  such  as  these,  censoring  at  p,  q 
leads  to  the  same  points  as  censoring  at  1-q,  1-p  so  that  the  tables 
are  quite  compact. 

An  interesting  fact  is  that  the  asymptotic  points  change  fairly 
slowly  with  the  censoring  pattern.  It  seems  possible  that  more  detailed 
study  would  suggest  a  useful  approximate  correction  factor  connecting  the 
p,  q  censored  case  to  the  uncensored  case;  in  any  case  interpolation  in 
the  tables  works  well. 

For  finite  samples,  Monte  Carlo  points  have  been  found  for 

the  normal  test,  and  for  uncensored  samples  of  sizes  n  =  20  and  for 

2 

n  *  1*0.  These  are  given  in  Table  2.  For  A  ,  the  points  for  finite 
n  converge  fairly  quickly  to  the  asymptotic  points;  we  observed  that  use  of 
the  exact  values  for  the  m^  (rather  than,  say,  Blom’s  approximation 
mi  at  $_1{(i-3/8)(n+lA)})  makes  the  convergence  faster. 
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Statistics  T  and  M  .  Statistics  T  and  M  are  asymptotically  normally 
distributed  with  mean  0  ,  and  variance  given  by  (9).  For  the  uncensored  case, 
these  have  been  worked  out  analytically.  For  the  normal  distribution, 

■  (1  -  3^/x)/8  =  0.056084  and  o^  “  3/16  =  0.1875  .  For  the  logistic 

1.  M 

2  2 

distribution  the  values  are  =  (it  -  9 )  / 1 2  =  0.07247  and 
2  2  2 

ow  -  1  -  it  / 12  +  (0.5  -  log  2)  »  0.21484.  These  statistics  also  converge 

M 

quickly  to  their  asymptotic  distributions,  as  one  would  expect. 

Thus,  to  make  a  test  for  normality  based  on  the  median,  we  calculate 
M*  *  {r/0.1875}^  ^z({r+i}/2  ~  if  r  is  °dd,  ar*d  refer  M*  to  a  standard 

normal  distribution;  if  r  is  even,  the  bracket  including  z  is  replaced  by 
^Z({r+l}/2)  ”  (r+2)/{2(r+l) }] .  For  the  test  based  on  the  mean, 

T*  =  {r/0.0561}^ (z  -  0.5)  is  refered  to  the  standard  normal  distribution. 

2 

Note  that  if  the  z^  were  ordered  uniforms,  the  variance  oT 

2 

would  be  1/12  ■  0.0833,  and  ow  would  be  0.25  .  The  true  variances  are 

M 

much  smaller  especially  in  the  normal  case. 

Some  calculations  have  also  been  made  when  the  test  is  for  the 

normal  or  the  logistic  distribution,  but  the  sample  tested  is  actually  uniform. 

For  the  logistic  test,  statistics  T  and  M  are  again  asymptotically  normal  with 

2  2 

mean  0  ,  and  the  variances  are  o_  *  3/70  *  0.04286  and  aw  *  0.3.  The  algebra 

T  M 

involved  in  the  calculations  is  extensive  and  will  be  published  elsewhere. 
Straightforward  calculations  then  show  that  the  asymptotic  power  of  T  ,  for  a 
5Z  test  against  a  uniform  alternative,  is  0.011,  that  is  statistic  T  is  both 
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inconsistent  and  biased.  For  M  the  asymptotic  power  is  0.097,  very  low, 
and  showing  that  M  is  not  consistent.  Similar  results  hold  for  T  and 
M  in  the  test  for  normality  against  the  uniform  alternative;  the 
asymptotic  power  of  M  is  0.11,  so  that  M  is  not  consistent,  and  that  of  T 
is  0.031,  so  that  T  is  biased  and  inconsistent.  The  Monte  Carlo  studies 
in  Section  6  below  verify  these  results. 
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5.  EXAMPLE. 

Example.  Table  3,  part  (a),  give  15  values  of  X  ,  a  measure  of  endurance 
of  industrial  specimens,  taken  from  Section  6.2  of  Biometrika  Tables  for 
Statisticians,  Vol.  2.  Graphical  plots  are  given  there  and  suggest  that 
the  logarithms  might  be  normally  distributed.  Also  given  in  Table  3  are 
the  values  x^,  the  logarithms  of  X^ ,  values  of  ,  the  normalized 

spacings  ,  and  the  values  z^.^  ,  together  with  the  values  of  the  test 

2  . 

statistics.  Reference  to  Table  1  shows  that  A  is  not  nearly  significant, 

so  that  lognormality  of  the  original  values  is  acceptable.  The  values  of 
M*  and  T*  (Section  4  above)  are  -0.958  and  -0.594  and  these  too 
are  not  significant. 

In  part  (b)  the  calculations  are  shown  for  a  censored  sample 

consisting  of  the  first  11  of  the  ordered  X  set;  again  normality  can  be 

accepted.  If  the  original  X  are  used  without  taking  logarithms,  values 
2 

of  A  are  7.424  for  the  whole  set,  and  3.262  for  the  censored  set. 
Reference  to  Table  1,  with  p  =  0  and  q  =  11/15  ,  shows  both  of  these 
to  be  significant  at  the  1%  level.  These  results  agree  with  results  of 
other  tests  described  in  Biometrika  Tables  for  Statisticians. 
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6.  POWER  COMPARISONS. 

In  this  section,  we  examine  the  power  of  the  tests  for 

normality.  Table  A  gives  the  results  of  Monte  Carlo  power  studies,  for  tests 

with  uncensored  samples.  The  tests  are  for  sample  sizes  n  =  20  and  n  =  AO, 

2 

and  the  test  level  is  5%.  The  test  statistics  compared  are  A  ,  M  and  T  , 

2 

against  the  well-known  Anderson-Darling  statistic  A  (Case  3)  and  the 

2 

Shapiro-Wilk  (1965)  statistic  W  .  In  A  (Case  3),  the  Anderson-Darling 

statistic  is  calculated  using  values  z,.N  =  G{x,..},  with  estimators  x  and 

°  (i)  (i) 

2  .  2 
s  for  the  normal  distribution  parameters  u  and  o  .  Critical  points  are 

given  by  Stephens  (197A). 

2  2 

The  power  studies  show  A  ,  A  (Case  3)  and  W  to  have  much  the 

2  2 
same  power  overall.  A  detects  skew  alternatives  better,  and  W  and  A  (Case  3) 

are  better  against  symmetric  alternatives.  M  and  T  are  poor  in  power  against 

symmetric  alternatives;  the  results  for  the  uniform  and  logistic  alternatives, 

for  example,  verify  the  asymptotic  results  of  Section  3,  that  T 

and  M  can  be  not  consistent  or  even  biased.  M  was  originally 

introduced  in  connection  with  tests  for  the  2-pararaeter  Weibull  distribution 

against  a  special  class  of  alternatives,  and  was  suggested  as  a  one-tailed 

test.  Here  we  have  a  wide  range  of  alternatives  and  M  and  T  have  both 

2 

been  used  as  two-tailed  tests.  Further  examination  of  M  ,  T  and  A  m 

connection  with  tests  for  the  Weibull  distribution  is  in  Lockhart,  O'Reilly 

2 

and  Stephens  (198A);  again  A  has  good  power. 
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Table  2 

2 

Percentage  points  for  A  for  complete  samples  of  size  n  from  a  normal 


distribution. 


Upper  tail  significance  level 


n 

0.25 

0.10 

0.05 

0.025 

.01 

20 

1.016 

1.521 

1.946 

2.345 

2.952 

AO 

0.980 

1.487 

1.887 

2.313 

2.832 

CO 

0.955 

1.422 

1.798 

2.191 

2.728 

*.  *. 
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Table  3 


Values  X  of  endurance  measurements  and  calculations  for  A  ,  T*  and  M*. 


Part  (a) 

Values  X(i) 

0.20 

0.33 

0.45 

0 

0.92 

0.95 

0.97 

1 

2.22 

2.275 

3.65 

7 

Values  x., .  \  : 

(.1 ) 

-1.609 

-1.109 

-0.799 

-0 

-0.034 

-0.051 

-0.030 

0 

0.798 

0.322 

1.295 

1 

m. 

l 

-0.335 

-1.243 

-0.04 8 

-0 

-0.335 

-0.165 

0.000 

0 

yi 

1.026 

1.033 

0.366 

2 

0.189 

0.126 

0.422 

2 

0.123 

2.031 

2.169 

0 

Z(i) 

0.066 

0.132 

0.156 

0 

0.376 

0.3S5 

0.412 

0 

0.700 

0.331 

D.?”0 

A2  =  0. 

375 

Median 

z(?)  =  0.385 

M*  - 

(13/0.1875)* 

(0.385 

Mean  z 

=  0.461 

T*  - 

(13/0.0561)* 

(0.461 

Part  (b)  z^  = 

0.095 

0.191 

0.225 

0 

0.544 

0.555 

0.595 

0 

II 

0.6067 

Median 

z(5)  =  °*^2^ 

M*  - 

(9/0.1875)* 

(0.526 

Mean  z 

=  0.449 

T*  - 

(9/0.0561)* 

(0.449 
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Table  4 

Power  comparisons:  Tests  of  normality.  Test  Level  =  5%.  The  table  gives  the 


percentage  of  5000  Monte  Carlo  samples  declared  significant  by  the  appropriate 
statistics . 

n  =  20 


Alternative 


A“  (Case  3) 


Shapi ro 
Wilk. 


X2  1  d.f. 

98 

99 

‘-9 

96 

98 

X2  2  d.f. 

78 

83 

37 

•oj 

S3 

X2  3  d.f. 

59 

63 

69 

j  - 

68 

X2  A  d.f. 

50 

54 

58 

5  S 

X2  10  d.f. 

24 

24 

26 

-0 

29 

Exponential 

79 

83 

87 

i  <~ 

82 

Log  Normal 

93 

94 

96 

/>;• 

95 

Uniform 

20 

">  n 

14 

4 

Logistic 

11 

10 

9 

6 

10 

Laplace 

30 

25 

2  0 

10 

20 

C1 

90 

88 

84 

50 

67 

C2 

53 

51 

*  7 

£4 

40 

c3 

34 

34 

28 

5 

25 

C4 

26 

26 

21 

7 

21 

n  =  40 

X2  1  d.f. 

100 

100 

100 

100 

100 

X2  2  d.f. 

98 

100 

100 

9o 

99 

X2  3  d.f. 

97 

93 

97 

56 

96 

X2  «  d.f. 

82 

89 

Q  2 

90 

X2  10  d.f. 

39 

44 

50 

53 

Exponential 

99 

99 

100 

99 

Log  Normal 

100 

100 

100 

ao 

100 

Uniform 

46 

62 

43 

4 

Logis  tic 

14 

12 

12 

o 

13 

Laplace 

50 

42 

36 

10 

21 

t. 

100 

99 

98 

.  u2 

76 
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